Designing better frog call recognition models
نویسندگان
چکیده
Advances in bioacoustic technology, such as the use of automatic recording devices, allow wildlife monitoring at large spatial scales. However, such technology can produce enormous amounts of audio data that must be processed and analyzed. One potential solution to this problem is the use of automated sound recognition tools, but we lack a general framework for developing and validating these tools. Recognizers are computer models of an animal sound assembled from "training data" (i.e., actual samples of vocalizations). The settings of variables used to create recognizers can impact performance, and the use of different settings can result in large differences in error rates that can be exploited for different monitoring objectives. We used Song Scope (Wildlife Acoustics Inc.) to build recognizers and vocalizations of the wood frog (Lithobates sylvaticus) to test how different settings and amounts of training data influence recognizer performance. Performance was evaluated using precision (the probability of a recognizer match being a true match) and sensitivity (the proportion of vocalizations detected) based on a receiver operating characteristic (ROC) curve-determined score threshold. Evaluations were conducted using recordings not used to build the recognizer. Wood frog recognizer performance was sensitive to setting changes in four out of nine variables, and small improvements were achieved by using additional training data from different sites and from the same recording, but not from different recordings from the same site. Overall, the effect of changes to variable settings was much greater than the effect of increasing training data. Additionally, by testing the performance of the recognizer on vocalizations not used to build the recognizer, we discovered that Type I error rates appear idiosyncratic and do not recommend extrapolation from training to new data, whereas Type II errors showed more consistency and extrapolation can be justified. Optimizing variable settings on independent recordings led to a better match between recognizer performance and monitoring objectives. We provide general recommendations for application of this methodology with other species and make some suggestions for improvements.
منابع مشابه
Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملFemale preferences for temporal order of call components in the túngara frog: a Bayesian analysis.
We employed a Bayesian statistical approach to examine female preferences in the Neotropical frog Physalaemus pustulosus for the temporal relationship of the two parts of the conspecific advertisement call. The male advertisement call consists of a 'whine', which is necessary for species recognition, followed immediately by one or more 'chucks', which make the whine more attractive to females. ...
متن کاملDecoupled Evolution between Senders and Receivers in the Neotropical Allobates femoralis Frog Complex
During acoustic communication, an audible message is transmitted from a sender to a receiver, often producing changes in behavior. In a system where evolutionary changes of the sender do not result in a concomitant adjustment in the receiver, communication and species recognition could fail. However, the possibility of an evolutionary decoupling between sender and receiver has rarely been studi...
متن کاملAuditory selectivity for acoustic features that confer species recognition in the tungara frog.
In anurans, recognition of species-specific acoustic signals is essential to finding a mate. In many species, behavioral tests have elucidated which acoustic features contribute to species recognition, but the mechanisms by which the brain encodes these species-specific signal components are less well understood. The túngara frog produces a `whine' mating call that is characterized by a descend...
متن کاملTúngara frogs
Why are they called túngara frogs? As in most frog species, males call to attract females. Unlike most other frog species, túngara frogs produce both simple and complex calls. A simple call consists of a frequencymodulated sweep called a ‘whine’, while a complex call is a whine plus one to six or seven broadband ‘chucks’. The name ‘túngara’ frog is onomatopoeically derived from the male’s call:...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 7 شماره
صفحات -
تاریخ انتشار 2017